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METHOD FOR CLONING AND EXPRESSION OF SBFI 
RESTRICTION ENDONUCLEASE AND SBFI METHYLASE IN 

E.COLI 

5 BACKGROUND OF THE INVENTION 

Present embodiments of the invention relate to 
recombinant DNA that encodes the Sbfl restriction endonuclease 
(Sbfl endonuclease; R.Sbfl) as well as the Sbfl modification 
10 methyltransferase (Sbfl methyltransferase; M.Sbfl), and to the 

expression of the Sbfl endonuclease and methyltransferase in E. 
coli cells that contain the recombinant DNA. 

Restriction endonucleases are enzymes that occur 
15 naturally in certain unicellular microbes— mainly bacteria and 

archaea— and that function to protect these organisms from 
infections by viruses and other parasitic DNA elements. 
Restriction endonucleases bind to specific sequences of 
nucleotides ('recognition sequence') in double-stranded DNA 
20 molecules (dsDNA) and cleave the DNA, usually within or close to 

the sequence, disrupting the DNA and triggering its destruction. 
Restriction endonucleases commonly occur with one or more 
companion enzymes termed modification methyltransferases. 
Methyltransferases bind to the same sequences in dsDNA as the 
25 restriction endonucleases they accompany, but instead of 

cleaving the DNA, they alter it by the addition of a methyl group 
to one of the bases within the sequence. This methylation 
('modification 1 ) prevents the restriction endonuclease from 
binding to that site thereafter, rendering the site resistant to 
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cleavage. Methyltransferases function as cellular antidotes to 
the restriction endonucleases they accompany, protecting the 
cell's own DNA from destruction by its restriction endonucleases. 
Together, a restriction endonuclease and its companion 
5 modification methyltransferase(s) form a restriction-modification 

(R-M) system, an enzymatic partnership that accomplishes for 
microbes what the immune system accomplishes, in some 
respects, for multicellular organisms. 

A large and varied class of restriction endonucleases have 
been classified as 'Type II' restriction endonucleases. These 
enzymes cleave DNA at defined positions, and in purified form 
can be used to cut DNA molecules into precise fragments for 
gene cloning and analysis. The biochemical precision of Type II 
restriction endonucleases far exceeds anything achievable by 
chemical methods, making these enzymes the reagents sine qua 
non of molecular biology laboratories. In this capacity, as 
molecular tools for gene dissection, Type II restriction 
endonucleases have had a profound impact on the life sciences 
in the past 25 years, transforming the academic and commercial 
arenas, alike. Their utility has spurred a continuous search for 
new restriction endonucleases, and a large number have been 
found. Today more than 200 Type II endonucleases are known, 
each possessing different DNA cleavage characteristics (Roberts 
and Macelis, Nucl. Acids Res.29: 268-269 (2001)). (RE BASE®, 
http://rebase.neb.com/rebase) . Concomittantly, the production 
and purification of these enzymes has been improved by the 
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cloning and over-expression of the genes that encode them in 
non-natural production strain host cells such as E. colL 

Since the various restriction enzymes appear to perform 
similar biological roles, in much the same ways, it might be 
thought that they would resemble one another closely in amino 
acid sequence and behavior. Experience shows this not to be 
true, however. Surprisingly, far from resembling one another, 
most Type II restriction enzymes appear unique, resembling 
neither other restriction enzymes nor any other known kind of 
protein. Type II restriction endonucleases seem to have arisen 
independently of one another for the most part during evolution, 
and to have done so hundreds of times, so that today's 
enzymes represent a heterogeneous collection rather than a 
discrete family. Some restriction endonucleases act as 
homodimers, some as monomers, others as heterodimers. Some 
bind symmetric sequences, others asymmetric sequences; some 
bind continuous sequences, others discontinuous sequences; 
some bind unique sequences, others multiple sequences. Some 
are accompanied by a single methyltransferase, others by two, 
and yet others by none at all. When two methyltransferases are 
present, sometimes they are separate proteins, at other times 
they are fused. The orders and orientations of restriction and 
modification genes vary, with all possible organizations 
occurring. Several kinds of methyltransferases exist, some 
methylating adenines (m6A-MMases), others methylating 
cytosines at the N-4 position (m4C-MMases), or at the 5 position 
(m5C-MMases). Usually there is no way of predicting, a priori, 
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which modifications will block a particular restriction 
endonuclease, which kind(s) of methyltransferases(s) will 
accompany that restriction endonuclease in any specific instance, 
nor what their gene orders or orientations will be. 

From the point of view of cloning a Type II restriction 
endonuclease, the great variability that exists among restriction- 
modification systems means that, for experimental purposes, 
each is unique. Each enzyme is unique in amino acid sequence 
and catalytic behavior; each occurs in unique enzymatic 
association, adapted to unique microbial circumstances; and 
each presents the experimenter with a unique challenge. 
Sometimes a restriction endonuclease can be cloned and over- 
expressed in a straightforward manner but more often than not 
it cannot, and what works well for one enzyme can work not at 
all for the next. Success with one is no guarantee of success 
with another. 

SUMMARY OF THE INVENTION 

20 

In an embodiment of the invention, an isolated DNA 
encoding the Sbfl restriction endonuclease is provided where the 
isolated DNA is obtainable from Streptomyces species Bf-61 . 

25 In an additional embodiment of the invention, a 

recombinant DNA vector is provided that includes a vector into 
which a DNA segment encoding the Sbfl restriction endonuclease 
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has been inserted. A host cell transformed by the recombinant 
vector is further provided. 

In an additional embodiment of the invention, an isolated 
DNA encoding the Sbfl restriction endonuclease and Sbfl 
methylase is provided where the isolated DNA is obtainable from 
ATCC No. PTA-5371. A vector that includes this isolated DNA and 
a host cell transformed by the vector is further provided. 

In an additional embodiment of the invention, a method is 
provided for producing recombinant Sbfl restriction endonuclease 
that includes culturing a host cell transformed with any of the 
vectors described above under conditions suitable for expression 
of the endonuclease. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Gene organization of the Sbfl R-M system. 
SbflR, Sbfl restriction endonuclease gene; SbflM, Sbfl methylase 
gene. 

Figure 2. The Sbfl methylase gene sequence (SbflM, 
1460 bp) (SEQ ID NO:l) and the encoded amino acid sequence 
(SEQIDNO:2). 

Figure 3. The Sbfl endonuclease gene sequence (SbfIR, 
971 bp) (SEQ ID NO:3) and the encoded amino acid sequence 
(SEQ ID NO:4). 
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Figure 4. A plasmid map of pACYC184-PstIM clone. 



Figure 5. A plasmid map of pACYC184-SbfIM clone. 



Figure 6. A plasmid map of pCAB16. 



Figure 7. A plasmid map of pLT7K-SbfIR endonuclease 

clone. 



Figure 8. Recombinant Sbfl endonuclease activity in cell 
extract. I DNA was used as the substrate. Lanes 2-8, lx, 1/2, 
1/4, 1/8,1/16,1/32,1/64 diluted cell extract added in the 
restriction digestions. Lane 1, I DNA digested with native Sbfl. 



DETAILED DESCRIPTION OF THE INVENTION 



The Sbfl endonuclease and methyltransferase are 
enzymes that occur in the bacterium Streptomyces species Bf-61 
(New England Biolabs' strain collection). The Sbfl endonuclease 
binds to the symmetric nucleotide (nt) sequence 5'-CCTGCAGG-3' 
in double-stranded DNA molecules (dsDNA) and cleaves the DNA 
between the A and G in each strand thus: 5'-CCTGCA/GG-3', 
producing DNA fragments with 4-nt cohesive ends (/ indicates 
the position of strand-cleavage). Many restriction endonucleases 
that occur in nature are accompanied by protective modification 
methyltransferases. However, restriction endonucleases that 
recognize and cleave long, infrequently occurring sequences 
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such as R.Sbfl, are not always accompanied by a protective 
methyltransferase. At the outset of these experiments it was not 
known whether a modification methyltransferase in fact 
accompanied R.Sbfl. 

5 

Hurdles in Cloning Sbfl restriction endonuclease 

(a) Unsuccessful methylase selection 
The methylase selection method described in U.S. Patent 

No. 5,200,333 is the preferred method for cloning restriction- 
modification systems. Methylase selection did not however yield 
the Sbfl methylase gene (M.Sbfl). The reasons for failure could 
include any or all of the following technical difficulties: 
construction of initial libraries where potential cloning sites may 
cut within the methylase gene; failure to clone the proper 
fragment from the libraries due to size of the DNA fragment; 
gene toxicity; low expression of the methylase gene, or the 
complete absence of the accompanying methylase gene, as with 
certain other 8-nucleotide specific restriction endonucleases such 
as Pad 5-TTAAT/TAA- 3 ' (U.S. Patent No. 5,098,839). 

(b) Absence of SbflM downstream of SbfIR 
Although the possibility existed that there was no M.Sbfl, 

alternative approaches to cloning the gene were attempted. 
25 Subsequently an Sbfl methyltransferase was indeed identified 

by a PCR reaction on the original pUC19-Sau3AI partial Sbfl DNA 
library that failed to yield a selectable sbflM gene. The PCR 
reaction showed the presence of a Sau3Al partial fragment that 
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contained the sbflM region. The location of the sbflM region on 
chromosomal DNA was eventually determined by inverse PCR to 
be not downstream of the sbfIR gene as first thought but 
upstream of the sbfIR gene. The sequence of sbflM was 
subsequently obtained. 

The Sbfl methylase gene was cloned by PCR from S. Bf-61 
chromosomal DNA into pACYC184 and transformed into E. colt. 
The cloned Sbfl methyltransferase bound to the same nucleotide 
(nt) sequence in dsDNA as R.Sbfl, and catalyzed the addition of 
a methyl group to the adenine residue in each strand (5'- 
CCTGCmAGG-3 , )/ producing modified DNA molecules that was 
resistant to cleavage by R.Sbfl (mA indicates the modified base). 

The Sbfl restriction endonuclease gene (sbfIR) was 
identified using PCR primers based on the Sbfl endonuclease 
(R.Sbfl) amino acid sequence from the N-terminal end and the 
cyanogen bromide-digested ends of the protein. The gene was 
subsequently cloned by inverse PCR from Sbfl chromosomal DNA. 

(c) Inconclusive Expression Levels for Endonuclease 

The PCR fragment obtained by inverse PCR containing the 
sbfIR gene was inserted into the plasmid vectors pRRS and 
pLT7K and used to transform M.PstI pre-modified E. colt. M.PstI 
premodification was used because M.Sbfl had not yet been 
identified and Pstl methylase (M.PstI) was shown to protect Sbfl 
sites in addition to Pstl sites against both Pstl and Sbfl 
endonuclease digestion of purified Pstl methylated DNA. 
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No clones with inserts were found with pRRS; some clones 
were found with pLT7K where the regulated T7 expression 
vector pLT7K contained a constitutive anti-sense promoter 
downstream of sbfIR to reduce basal expression. Some pLT7K 
clones contained correct DNA sequence, however these 
produced very low Sbfl endonuclease activity upon induction. 
This negative result suggested there was selection pressure to 
isolate endonuclease mutants with reduced activity. This 
disappointing result suggested that highly expressing clones 
were being selected against perhaps because of under- 
methylation of the host chromosome DNA by M.Pstl. 

Since expression from a medium-copy-number T7 vector in 
E. coli pre-modified with M.PstI did not generate a stable high 
expression clone, efforts were made to express the sbfIR gene 
in M.Sbfl methylated E.coli using pLT7K. When the Sbfl 
endonuclease gene was cloned in M.Sbfl pre-modified E. coli, a 
stable and over-expressing clone was established. Over- 
expression of an enzyme is generally intended to mean at least 
10 5 ug including 10 6 or 10 7 ug. Low expression is less than 10 3 ug. 
Low expression levels of a putative cloned restriction 
endonuclease may result in cleavage profiles of DNA. However, 
this does not conclusively prove that the desired enzyme has 
been obtained. For example, the enzyme digest may be partial 
or incomplete making it unclear whether the products are merely 
the result of random cleavage. 
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(d) Obtaining an over-expressing clone 

A stable over-expressing clone of R.Sbfl was obtained as 
follows: The sbfIR gene was amplified by PCR from genomic DNA. 
Following purification, the resulting PCR fragment was blunt-end 
ligated into pCAB16 at a BsaAI site. pCAB16 is a pUC18 
derivative containing the mspIR gene in the polylinker of pUC18 
in line with the Plac promoter. pCAB16 contains a single BsaAI 
site within the mspIR gene. Insertions at this site interrupt 
mspIR expression (which would otherwise be lethal) enabling 
plasmids containing inserts to be selectively recovered with high 
efficiency (Figure 6). The sbfIR PCR-fragment was ligated into the 
BsaAI site of pCAB16, and transformed into M.Pstl pre-modified 
E. coli. Clones that were found to carry the sbfIR PCR insert were 
cultured. However, assays showed these clones had no 
detectable Sbfl endonuclease activity. DNA sequencing of these 
clones showed that an intact sbfIR gene was present in the 
opposite orientation to Plac and mspIR, which could explain the 
lack of R.Sbfl activity. 

20 Taking an alternative approach, the sbfIR gene was 

purified from the pCAB16-SbfIR plasmid by gel purification. The 
resulting DNA fragment was ligated into pl_T7K and transformed 
into M.PstI pre-modified E. coli. Clones found to carry the PCR 
insert were induced with IPTG and assayed for Sbfl activity on 

25 X DNA. The extracts generated partial Sbfl digestion pattern. DNA 

sequence of one of these clones showed that it carried an intact 
sbfIR gene. Since the DNA sequence was correct for this pLT7K- 
sbflR plasmid, most likely highly expressing clones were being 
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selected against during induction. When the sbflM gene was 
subsequently identified and cloned into pACYC184, this same 
pLT7K-SbfIR clone was then transformed into M.Sbfl pre-modified 
E. coli. Transformants in which the sbflM gene was expressed 
from a low-copy-number plasmid pACYC184 and the sbfIR gene 
was expressed using the medium-copy-number pLT7K within the 
same E. coli host were cultured and their cells extracts were 
assayed for Sbfl activity on X DNA. The recombinant SbfL 
endonuclease yield was ~10 5 units/g of wet cells from the over- 
producing strain. 

In summary, an expression strategy was ultimately 
developed which overcame a number of hurdles and ultimately 
proved successful in yielding over-expressed R.Sbfl. This 
strategy relied in one embodiment on expressing the R.Sbfl and 
M.Sbfl under different strength promoters namely a medium copy 
number promoter and a low copy number promoter respectively. 
However, in alternative embodiments, both sbflM and sbfIR 
genes may be expressed under the same promoter. 

Moreover, the sbflM gene may be cloned in a single 
plasmid together with the sbfIR gene under the same or 
different promoters or in seperate plasmids under the same or 
different promoters. 

The method described herein by which the sbflM and sbfIR 
genes are preferably cloned and expressed in E. coli include the 
following steps: 
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Preparation of genomic DNA and construction of Sbfl genomic 
DNA library 

Genomic DNA was prepared from Streptomyces species Bf- 
61 by the 2X Kirby method ((Hopwood et al. Genetic Manipulation 
of Streptomyces. A Laboratory Manual. John Innes Foundation, 
Norwich, p. 77 (1985)). 

Partially digested genomic DNA preparations were ligated 
to a digested, CIP-treated pUC19 vector into which two Sbfl 
sites had been previously engineered. The ligated DNA mixtures 
were used to transform E. coli. Transformants from each library 
were pooled and amplified, and plasmid DNA was prepared to 
generate primary plasmid libraries. 

Methylase-selection 

The primary plasmid libraries were challenged by digestion 
with Sbfl. These DNA digests were then transformed back into E. 
coli and plasmid DNA was prepared from some of these initial the 
survivors of each selected primary library. Sbfl digestion of these 
indicated that none were resistant to digestion, suggesting that 
none carried the sbflM gene. Remaining surviving colonies from 
some of the challenged primary libraries were also pooled to 
form a secondary library, and challenged a second time with Sbfl. 
Plasmid DNA of these survivors again showed no resistance to 
Sbfl endonuclease digestion. 
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The non-recombinant Sbfl endonuclease was purified-and 
the N-terminal amino acid sequence of this protein was 
determined along with the amino acid sequence from 
cyanobromide digested R.Sbfl fragments. Based on these amino 
acid sequences, converging sets of degenerate and non- 
degenerate primers were synthesized. These were used to 
prime PCR reactions on Sbfl chromosomal DNA generating DNA 
fragments containing the 5' end of the sbfIR gene. DNA 
sequencing revealed an open reading frame (ORF) of 555 bp 
that had extensive homology to the R.PstI endonuclease and to 
its isoschizomers, BsuBI and Xphl, The Pstl recognition 
sequence, 5'-CTGCA/G-3', is encompassed by the Sbfl 
recognition sequence, 5'-CCTGCA/GG-3\ These similarities in 
recognition sequences and amino acid sequences strongly 
suggested that the 555 bp ORF comprised the 5' end of sbfIR 
gene. 

Inverse PCR amplification of DNA downstream of the 5' end 
20 Sbfl endonuclease gene 

Following cloning and identification of the N-terminal 
portion of the sbfIR endonuclease gene, efforts were made to 
clone 3' end of sbfIR and the adjacent downstream DNA. 

25 

Genomic DNA was digested with multiple restriction 
enzymes and self-ligated. The resulting circular DNA molecules 
were used as templates for inverse PCR. The DNA sequence at 
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the N-terminus of the sbfIR gene was used to design primers for 
the inverse PCR of Sbfl chromosomal DNA. 

Not all inverse PCR reactions of self-ligated genomic DNA 
generated inverse PCR fragments. Only the Hindi and 
AYpyCH4HIV templates produced PCR fragments that could be 
purified for cloning. The Hindi- and Hp/CH4HIV- inverse PCR 
fragments were ligated into pUC19 and transformed into E. coli. 
Clones with PCR inserts were sequenced directly with pUC19 
universal primers. 

From this DNA sequence near the newly found Hindi site, a 
new primer was designed and was used in a PCR reaction with a 
primer from the N-terminal Sbfl endonuclease sequence to 
generate a fragment that linked the N-terminal sequence to the 
inverse PCR sequence downstream of 5' sbfIR. The PCR fragment 
was purified, digested and ligated into pUC19. This ligated DNA 
was transformed into E. coli and clones with PCR inserts were 
sequenced directly with pUC19 universal primers. 

The Hindi and HpyCHAHlV PCR overlapping fragments 
generated ~900 bp, and ~500 bp of new sequences, 
respectively. Combining these sequences with the 5' sbfIR 
sequence, a complete ORF of 969 bp was found, most likely 
representing the sbfIR restriction gene. An additional ~700 bp of 
sequenced Sbfl chromosomal DNA that was downstream of sbfIR 
was compared to the known gene products in GenBank using 
BLAST and did not appear to contain the Sbfl methylase gene. 
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Inverse PCR amplification of DNA upstream of Sbfl 
endonuclease and identification of Sbfl methylase 

Unable to isolate the sbflM gene by methylase selection, 
combined with the inability to identify the sbflM gene 
downstream of the Sbfl endonuclease, efforts were made to 
clone DNA upstream of the sbflR gene. 

PCR reactions were done on the Sbfl primary pUC19 
libraries used in the initial methylase selection with two 
converging primers within the sbflR gene. PCR showed that the 
Sbfl primary libraries contained at least a portion of the sbflR 
gene. To identify any larger upstream DNA fragments that might 
be contained in the initial primary libraries, a second PCR was 
done on the libraries with pUC19 universal primers and a primer 
designed from within the sbflR gene oriented toward the 
upstream or 5' end of sbflR. These atypical PCR reactions 
generated fragments from the Bgfll, Bell and Sau3Al primary 
libraries that were larger than 1.6 kb and potentially large 
enough to contain upstream DNA for the sbflM gene. These 
library generated PCR fragments were purified and DNA 
sequenced directly using the PCR primers. 

The only readable sequence came from the Sbfl Sau3Al 
library PCR fragment. This DNA sequence was used to design a 
set of new PCR primers to be used in PCR with a converging 
primer from within the sbflR gene to hopefully directly clone the 
DNA upstream relative to the sbflR gene from Sbfl chromosomal 
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DNA. These PCR fragments were purified, digested, and ligated 
into pUC19 with compatible ends, followed by transformation 
into E. coli. Colony PCR identified PCR inserts and clones 
containing about the correct size fragment were further purified 
by CsCI purification and sequenced with pUC19 universal and 
custom primers. 

DNA sequencing revealed a new upstream ORF, which was 
determined to be the 3' end of the sbflM gene, which was 
coupled and arranged in head-to-tail fashion to the sbfIR gene. 
The sbflM transcriptional (TGA) stop codon was found to overlap 
the GTG start codon for the Sbfl endonulease gene. DNA 
sequencing across the junction revealed that along with the GTG 
start for the sbfIR gene, an additional translated amino acid, 
serine (S), is present making R.Sbfl one amino acid longer than 
predicted from N-terminal amino acid sequence. In fact R.Sbfl is 
actually 323 amino acids in length, not 322 amino acids. With the 
additional serine, the full sbfIR gene is 971 bp. (Figure 3). 

Unpredictably during cloning, due to a designed Hindlll site 
in the PCR primer, DNA sequencing revealed an unknown Hindlll 
site contained within the sbflM gene. This truncated the PCR 
fragment during cloning and in effect cut off the 5' end of the 
sbflM gene. Efforts were then made to clone 5' end of sbflM 
using inverse PCR with new primers designed from within the 
truncated sbflM gene. Inverse PCR was performed on genomic 
DNA digested with Hindi and self-ligated. The resulting PCR 
fragments were purified, ligated into pCAB16 and transformed 
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into E. coli. Plasmid DNAs were purified and sequenced. 
Combining this DNA sequence with the truncated 3' sbflM gene 
DNA sequence, the full length sbflM gene of 1460 bp was 
revealed, which encodes a translated Sbfl methylase of 496 
5 amino acids (Figure 2). 

Isolation of intact SbfIR gene in E.coli 

Since the sbflM gene was not isolated initially, the 
10 cloning/expression strategy was to use M.Pstl to pre-modify E. 

coli by expressing the pstIM gene in a low copy-number plasmid, 
PACYC184, and the sbfIR gene in either a high copy-number 
constitutive vector pRRS, or a medium-copy-number, regulated 
vector, pLT7K. 

15 

The sbfIR gene was amplified from genomic DNA by PCR 
with Deep Vent® DNA polymerase. Following purification and 
digestion the PCR fragment was ligated into pRRS and pLT7K, 
respectively, digested with the same enzymes to create 

20 compatible ends. The pRRS-SbfIR ligation was transformed into 

M.Pstl pre-modified E. coli and plasmids were purified and 
screened for inserts. No pRRS-SbfIR clones were found. The 
pLT7K-SbfIR ligation was transformed into E. coli ER2502 (lacking 
T7 RNA polymerase). Positive clones were identified and these 

25 were then transferred into a M.Pstl E. coli ER2744 (containing 

the T7 RNA polymerase). Cell cultures were made from individual 
pLT7K-SbfIR transformants and induced with IPTG. Cell extracts 
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were prepared and assayed for Sbfl endonuclease activity and 
none produced detectable Sbfl endonuclease. 

In another attempt, a larger PCR fragment containing the 
sbfIR gene plus 600 bp of downstream DNA (~1500 bp), was 
digested and ligated into pLT7K and transformed into E. coll. 
Positive clones were identified and then transferred into M.Pstl 
pre-modified E. coll ER2744 and induced with IPTG, assayed for 
Sbfl activity, yielding no detectable activity. This same pLT7K- 
SbflR ligation of the sbfIR gene, including 600 bp of downstream 
DNA, was also directly transformed into M.Pstl pre-modified E. 
coli ER2744. Positive pLT7K-SbfIR clones were identified by PCR, 
cultured, induced with IPTG and then assayed again yielding no 
detectable Sbfl activity. This negative result indicated there was 
selection pressure to isolate endonuclease mutants. 

In order to isolate a clone containing the sbfIR gene with 
the correct DNA sequence from the native Streptomyces species 
Bf-61 strain, the sbfIR gene PCR was blunt-end ligated into 
20 pCAB16 at the SsaAI site followed by tranformation into M.Pstl 

pre-modified E. coli. Clones were found to carry the sbfIR gene 
fragment, and cultured, and assayed for Sbfl activity on k DNA. 
The extracts generated no Sbfl digestion pattern, however, DNA 
sequence from these clones showed an intact sbfIR gene in the 
25 opposite orientation to Plac and mspIR. 
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Expression of SbfIR gene in E.coli 

The sbfIR gene was gel purified from the pCAB16-SbfIR 
plasmid, ligated into pLT7K and transformed into M.Psrl pre- 
modified E. coli. Clones were found to carry the sbfIR gene and 
assayed for Sbfl activity on X DNA. The extracts generated partial 
Sbfl digestion pattern. DNA sequencing of pLT7K-SbfIR clones 
showed that each carried an intact sbfIR gene. Since the DNA 
sequence was correct for the pLT7K-SbfIR #12 plasmid, when 
the sbflM gene was identified and cloned into pACYC184, the 
pl_T7K- SbfIR #12 clone was then transformed into M.Sbfl pre- 
modified E. coli. All transformants were cultured, induced with 
IPTG and then assayed for Sbfl activity on X DNA yielding ~10 5 
units/g of wet cells from the over-producing strain. 

The present invention is further illustrated by the following 
Examples. These Examples are provided to aid in the 
understanding of the invention and are not construed as a 
limitation thereof. 

All references cited above and below are herein 
incorporated by reference. 



Docket No.: NEB-232 

-20- 
EXAMPLE 1 

Cloning of Sbfl restriction-modification system in 

E. coli 

1. Preparation of genomic DNA 

Genomic DNA was prepared from 2g of Streptomyces 
species Bf-61, by the following steps: 

a. Cell wall digestion by addition of lysozyme (2 mg/ml 
final), sucrose (1% final), and 50 mM Tris-HCI, pH 8,0. 

b. Cell lysis by addition of 8 ml of 2X Kirby mixture: (2g 
15 Sodium tri-isopropylnaphthalene sulphonate, 12g ^amino- 
salicylate, 10ml 1M Tris-HCI pH8, 6 ml phenol saturated with 
50mM Tris-HCI pH 8.0 made up in 100 ml dH 2 0) Vortex 1 minute. 

c. Removal of proteins by phenol-CHCI 3 extraction of 
DNA 2 times (equal volume). 

20 d. Dialysis in 4 liters of TE buffer, buffer change four 

times. 

e. RNase A treatment to remove RNA. 

f. Genomic DNA precipitation in 0.4M NaCI and 0.55 
volume of 100% isopropanol, spooled, dried and resuspended in 

25 TE buffer. 

2. Restriction digestion of genomic DNA and 
construction of genomic DNA library 
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Varying units of restriction enzymes Sau3Al, Bell, BstYl, 
Bglll, Xbal and Nhel were used to digest 10 mg genomic DNA to 
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achieve complete and limited partial digestion. The digested DNA 
was purified via phenol-CHCI 3 extraction and isopropanol 
precipitation. The Sau3AI-, Bell-, BstYl-, and Bg/ll-, digested DNAs 
were ligated to BamHl- digested and CIP-treated pUC19 vector 
containing 2 Sbfl sites. Xbal- and Nhel-, digested DNAs were 
ligated to the same vector Xbal- digested and CIP treated. 
Following overnight ligation, the DNA was used to transform an 
endA" host (ER2502, ER2683 New England Biolabs' collection 
(Beverly, MA)), made competent by CaCI 2 method. Approximately 
2-5,000 Ap R transformants were obtained from each library. For 
each enzyme, colonies were pooled and amplified in 500ml LB + 
Ap overnight. Plasmid DNA was prepared by CsCI gradient 
purification, resulting in a primary library. 

15 3. Attempt to Clone SbflM gene by methylase 

selection 

The primary plasmid DNA library (1 mg DNA) was 
challenged by digestion with ~30 units of Sbfl at 37°C for 1 hour. 

20 The digested DNA was transferred into ER2502 or ER2683 by 

transformation, resulting in ~750 Ap R survivors from all libraries. 
Plasmid DNA from ~120 survivors was prepared by the Compass 
Mini Plasmid Kit method, followed by Sbfl digestion. No resistant 
clones were found from any of the libraries. Some remaining 

25 survivors (Sai73AI, estYI, Xbal and Nhel) were also pooled 

separately to form secondary libraries, challenged with Sbfl a 
second time, followed by the same survivor Plasmid DNA 
purification, again no resistant clones were found. 



5 
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4. Identification of the Sbfl endonuclease 

The non-recombinant Sbfl endonuclease was purified to 
5 near homogeneity and the purified protein was subjected to 

SDS-PAGE. A protein band of ~36 kDa was detected. The N- 
terminal amino acid sequence was determined as (MN)SDGIDGTV 
ASIDTARALLKRFGFDAQRYNV (SEQ ID NO: 5). The 36 kDa protein 
was digested with cyanogen bromide and a 4.5 kDa fragment 
10 amino acid sequence was determined as (M)VEEFVPRFAPRSTV 

LYLGDTRGKHSLFEEEI (SEQ ID NO:6). Using these two amino acid 
sequences, converging sets of degenerate and non-degenerate 
primers were designed to PCR the beginning of the sbfIR gene 
from chromosomal DNA. 

15 

Two sets primers were synthesized with the following 
sequences: 

S'atgaactccgacggcatcgacS' sbfN-1 
20 (SEQ ID NO:7) 

5'aanacyaartcnaccat3' sbf45a 
(SEQ ID NO:8) 

25 5'atgaacagcgacggcatcgac3' sbfN-lb 

(SEQ ID NO:9) 

5'aasaccaactcctcsaccat3' sbf45a-2 
(SEQ ID NO:10) 

30 

The primers were used in two separate PCR reactions: 
sbfN-1 + sbf45a, sbfN-lb + sbf45a-2. PCR conditions were 95°C 
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for 5 min, 1 cycle; 95°C for 1 min, 54°C for 1 min, 72°C 1 min for 
25 cycles with Deep Vent® DNA polymerase. The PCR of Sbfl 
chromosomal DNA with the above primers generated ~550 bp 
DNA fragment. The PCR fragments were gel purified, phenol- 
CH 3 CI extracted and isopropanol precipitated. The resuspended 
PCR fragment was blunt-end ligated into pCAB16 at the BsaAI 
site followed by transformation into E.coli ER2502. pCAB16 
clones with PCR inserts were sequenced using the following 
sequencing primers: 

5'ggagccatacagagagcgatttattcg3' 167 
(SEQ ID NO:ll) 

5'ttgaaatcgaattaataagtctggatg3' 168 
(SEQ ID NO:12) 

DNA sequencing identified an open reading frame (ORF) of 
555 bp DNA fragment containing the 5' end of the sbfIR gene. 

5. Inverse PCR amplification of DNA downstream of 
the 5' end Sbfl endonuclease gene 

After identification of the N-terminus of the endonuclease 
gene, efforts were made to clone adjacent downstream DNA. 
DNA sequence at the N-terminus of the sbfIR gene was used as 
the template for primer design. 

Four primers were synthesized: 

5'ccagtccatgatcttctgaacgcc3' 5B 
(SEQ ID NO: 13) 
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5'cttcggcagtgggttgataatggc3' 3B 
(SEQ ID NO: 14) 

5'agggagatcgacagagatcatcgc3' 5C 
5 (SEQ ID NO: 15) 

5'tactgcgcgggcgctgctaaagcg3' 3A 
(SEQ ID NO:16) 

10 As a positive control, converging primers 3A and 5C, were 

used on Sbfl chromosomal DNA. For the inverse PCR, genomic 
DNA was individually digested with BsfBI, BstUI, Dral, Hindi, 
HpyCH4l\Z, Rsal and Seal. The digestions were inactivated at 
65°C for 20 min. Self-ligation was set up at a low DNA 

15 concentration at 2 mg/ml overnight at 17°C. The resulting circular 

DNA preps were used as the templates for inverse PCR. PCR 
conditions were 95°C for 5 min, 1 cycle; 95°C for 1 min, 62°C for 
1 min, 72°C for 2 min for 25 cycles. Converging primers 3A and 
5C generated the sbfIR ~400 bp control fragment. Inverse PCR 

20 products were found in the Hindi and A/pyCH4IV templates. The 

PCR products were gel-purified, phenol/CH 3 CI-extracted and 
isopropanol-precipitated. Immediately downstream of the 3B 
primer within the sbfIR N-terminus is an Apol site. This Apol site 
was used to digest the inverse PCR products at this site, 

25 followed by overnight ligation into fcoRI- and Hindi- digested 

pUC19. The ligated DNA was transferred into ER2502 and 
ER2683 by transformation. Plasmids were identified that 
contained the inverse PCR fragment and sequenced directly with 
pUC19 universal primers 1233 and 1224. Using this DNA 

30 sequence, another direct PCR was done with 3B and a newly 

designed converging primer Sb-3 having the following sequence: 
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5'gcggcaaccttcatccgg3' Sb-3 
(SEQ ID NO: 17) 

5 PCR conditions were 95°C for 5 min, 1 cycle; 95°C for 1 

min, 54°C for 1 min, 72°C for 2 min for 25 cycles. A PCR fragment 
containing ~1200 bp of the C-terminal end of sbfIR and 
downstream DNA was purified by phenol/CH3CI -extraction and 
isopropanol precipitation. The PCR fragment was digested with 

10 Apol and ligated into EcoRI- and Hindi- digested pUC19. The 

ligated DNA was transformed into ER2683 and plasmid DNA was 
purified. Plasmids containing the PCR fragment were sequenced 
with pUC19 universal primers 1233 and 1224. After cloning, then 
sequencing, an ORF with 969 bp long was found. This gene was 

15 judged likely to be the sbfIR restriction gene coding for Sbfl 

endonuclease as predicted by N-terminal R.Sbfl amino acid 
protein sequence. An additional ~700 bp of sequenced Sbfl 
chromosomal DNA that was downstream of sbfIR did not appear 
to contain the Sbfl methylase gene. 



20 



6. PCR amplification of DNA upstream of Sbfl 
endonuclease 



After failing to identify the sbflM gene downstream of the 
25 endonuclease gene, efforts were made to clone adjacent DNA 

upstream to sbfIR. A PCR reaction was done on the original 
pUC19 primary libraries of Sau3AI, Bell, BstYl, BgUl, Xbal and 
Nhel to check for the presence of sbfIR using converging primers 
5C and 3A: 



30 
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5'agggagatcgacagagatcatcgc3' 5C 
(SEQ ID NO:18) 

5'tactgcgcgggcgctgctaaagcg3' 3A 
(SEQ ID NO: 19) 

PCR conditions were 95°C for 5 min, 1 cycle; 95°C for 1 
min, 62°C for 1 min, 72°C for 2 min for 25 cycles. All libraries, 
except the Xbal library, contained the ~400 bp sbfIR fragment 
that should be generated by these two primers from sbfIR DNA 
sequence. Two separate PCR reactions were done with pUC19 
universal primers 1233 and 1224 with primer 5B, in the direction 
of 5' end of sbfIR, to determine if some of these pUC19 libraries 
might contain DNA upstream of the sequence sbfIR, perhaps 
containing the Sbfl methylase gene. The libraries tested were 
the Bg/II, Bell and Sau3AI pUC19 libraries. The PCR primers have 
the following sequence: 

5'agcggataacaatttcacacagga3' 1233 pUC19 Universal primer 
(SEQIDNO:20) 

5'cgccagggttttcccagtcacgac3' 1224 pUC19 Universal primer 
(SEQ ID NO:21) 

5'ccagtccatgatcttctgaacgcc3' 5B 
(SEQ ID NO:22) 

PCR conditions were 95°C for 5 min, 1 cycle; 95°C for 1 
min, 62°C for 1 min, 72°C for 8 min for 25 cycles. The 1233 and 
5B PCR primers generated a predominant fragment of ~1.6 kb 
for Bg/II and Sau3Al libraries; and ~4.0 kb fragment from the Bell 
library, both perhaps large enough to contain the whole Sbfl 
methylase gene (sbflM). The PCR products were gel-purified, 



Docket No.: NEB-232 

-27- 

phenol-CH 3 CI extracted and isopropanol precipitated, followed 
by direct sequencing of the PCR products. Only the PCR fragment 
from the Sbfl Sa</3AI-pUC19 partial library generated DNA 
sequence sufficiently readable to design new PCR primers for a 
direct PCR of the upstream region in its entirety. The Sau3AI 
library DNA sequence upstream of the sbfIR gene was used as 
the template for primer design and a new converging primer 5B- 
2 was made from sbfIR DNA sequence toward the upstream DNA 
sequence. 

The primers were synthesized with the following 
sequences: 

S'tggggcgaattcccagtccatgatcttctgaacgccS' 5B-2 
(underlined nt, EcoRI site) (SEQ ID NO: 23) 

5'tggggcaaflcttgatcaggtccgtg3' S3-1 

(underlined nt, Hindlll site) (SEQ ID NO:24) 

5'tggggcaagcttcgcctgctggttgacc3' S3-2 

(underlined nt, Hindll! site) (SEQ ID NO: 25) 

S'tgtggggcaagcttcgccccggtcgtccS' S3-3 

(underlined nt, Hindlll site) (SEQ ID NO: 26) 

S'tggggc aagcttc tgcgatccgctgccS' S3-4 

(underlined nt, Hindlll site) (SEQ ID NO:27) 

5 'tg gg g c a a g cttc g ttg gcg g tg c tcccg c3 ' S3-5 

(underlined nt, Hindlll site) (SEQ ID NO: 28) 

PCR conditions were 95°C for 5 min, 1 cycle; 95°C for 1 
min, 54°C for 1 min, 72°C for 1 min for 25 cycles. PCR fragments 
of ~1550 bp were found with both 5B-2 +S3-2 or 5B-2+S3-3 
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primers on Sbfl chromosomal DNA. The PCR fragments were gel- 
purified, phenol-CH 3 CI extracted and isopropanol precipitated, 
followed by EcoRI and Hindlll digestion. The digested PCR DNA 
was heated to 65°C for 15 min and ligated overnight at 17°C 
into EcoRI- and Hindlll- digested pUC19. The ligated DNA was 
transformed into ER2744 and colony PCR was done on 10 
colonies of each with universal pUC19 primers 1233 and 1224. 
PCR conditions were 94°C for 1 min, 1 cycle; 94°C for 10 sec, 
62°C for 1 min, 72°C for 1 min for 25 cycles. Most colonies 
contained a PCR fragment approximately 1500 bp. Plasmid DNA 
was purified for 6 clones, 3 for each PCR fragment, by CsCI 
method and then sequenced using primers 1233 and 1224. DNA 
sequence was only obtained from clone #5 which contained the 
PCR fragment generated by primers 5B-2 + S3-3. Clone #5 
contained a Hindlll to EcoRI fragment of ~1200 bp, slightly less 
then the original PCR fragment. The entire fragment was 
sequenced with seven additional primers. The sequencing 
primers have the following sequences: 

5'gagcaatgtcacagcgctacggac3' (51) 
(SEQ ID NO:29) 

5'gatccaacacagtcgagttcaacc3' (52) 
(SEQ ID NO:30) 

5'aacggcaaacggcgaaagaggacc3' (53) 
(SEQ ID NO:31) 

5 'cctcatgctttg g ttg a a ctcg a c3 ' ( 54 ) 
(SEQ ID NO:32) 

5'tcgttgctttcagtgtgaggccgc3' (55) 
(SEQ ID NO:33) 
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5'cgttgtgcccatggtttatcagtc3' (56) 
(SEQ ID NO:34) 

5 S'ctcttgctttaccttcgtgtccgcS' (57) 

(SEQ ID NO:35) 

After sequencing about 1.2 kb, a truncated ORF of 1040 bp 
long was found upstream of the sbfIR gene. This sequence most 

10 likely encoded the 3' end of the Sbfl methylase gene. When the 

amino acid sequence of this ORF was compared to the known 
gene products in GenBank using BLAST, it showed very high 
homology to A/6-methyl adenine methyltransferases especially 
those belonging to the PstI recognition family. The sbflM 

15 transcriptional (TGA) stop codon was found to overlap the (GTG) 

start codon for the Sbfl endonulease gene (sbfIR). DNA 
sequencing across the junction revealed that along with the GTG 
start, an additional amino acid, serine (S) is present making 
R.Sbfl one amino acid longer than predicted from N-terminal 

20 amino acid. The original incorrect sequence is (MN)SDGIDGTVAS 

IDTARALLKRFGFDAQRYNV (SEQ ID NO:36). In fact R.Sbfl is 
actually 323 amino acids in length, not 322 amino acids, and the 
N-terminal sequence is: MNSSDGIDGTVASIDTARALLKRFGFDAQ 
RYNV (SEQ ID NO:37). With the additional Serine, the full Sbfl 

25 endonuclease gene (sbfIR) is 971 bp. DNA sequencing of #5 also 

revealed an unknown Hindlll site within sbflM which shortened 
the original 5B-2 + S3-3 PCR fragment DNA between the S3-3 
primer to this Hindlll site was lost during cloning. 
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7. Inverse PCR amplification of DNA upstream of Sbfl 
endonuclease and identification of Sbfl methylase 

After identification of the truncated methylase gene, efforts 
were made to clone adjacent DNA encoding the 5' end of sbflM 
using inverse PCR. Using this sbflM DNA sequence new primers 
were designed having the following sequence: 

5'ggccacgtaaacgttcggtacatc3' (Al ) 
(SEQIDNO:38) 

5'tcatttcgctcaaagagcaggggc3' (Bl) 
(SEQ ID NO:39) 

The genomic DNA was digested with Hindi in appropriate 
restriction buffer and inactivated at 65°C for 20 min. Self-ligation 
was set up at a low DNA concentration at 2 mg/ml overnight at 
17°C. The circular DNA product was used as the template for 
inverse PCR. PCR conditions were 94°C for 5 min, 1 cycle; 94°C 
for 30 sec, 62°C for 1 min, 72°C for 1 min for 25 cycles. The PCR 
fragment was gel purified from an agarose gel, phenol/CH 3 CI- 
extracted and isopropanol precipitated. The resuspended PCR 
fragment was blunt-end ligated at 17°C overnight into pCAB16 
digested at the BsaAI site followed by tranformation into ER2502 
E. coli cells. Plasmid DNA was purified from twelve colonies. Ten 
appeared to contain the PCR DNA fragment. Four clones were 
sequenced directly with the following primers: 

5'ggagccatacagagagcgatttattcg3' 167 
(SEQ ID NO:40) 

5'ttgaaatcgaattaataagtctggatg3' 168 
(SEQ ID NO:41) 
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DNA sequencing identified an open reading frame (ORF) of 
495 bp DNA fragment containing the 5' end of sbflM. The 
combined DNA sequence upstream of the sbfIR gene revealed 
5 the total Sbfl methylase gene (sbflM). The sbflM gene is 1460 bp 

which encodes a translated Sbfl methylase of 486 amino acids. 
Transcription of M and R genes is oriented in the same direction. 
They are arranged in head-to-tail fashion (Figure 1). 

10 EXAMPLE 2 

Expression of SbfIR gene in E.coli 

Since the preferred methylase selection method failed to 
15 yield a Sbfl methylase gene, efforts were made initially to clone 

the sbfIR gene in a M.Pstl pre-modified E. coli strain in order to 
establish an sbfIR clone with the correct DNA sequence. The 
pstlM gene was first amplified from pBEA-14 plasmid containing 
the PstI methylase (provided by Bill Jack and Lucia Greenough, 
20 New England Biolabs (Beverly, MA)). The pstlM gene was cloned 

into a low copy number plasmid pACYC184 with pl5A origin and 
Cm R selection marker. The PCR primers have the following 
sequences: 

25 S'ttcc gggatcc ggaggtttaaaatatgactaagcggcaacacaattacctatatctcB 7 
(PstSM, underlined nt, BamHl site) (SEQ ID NO:42) 

5 'ttg ccgcatgcg teg tta ca tta g ctg ca a a ctctg a ttg a tta tttc3 ' 
(Pst3M, underlined nt, Sphl site) (SEQ ID NO:43) 
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PCR conditions were 95°C for 5 min, 1 cycle; 95°C for 1 
min, 54°C for 1 min, 72°C 1 min for 25 cycles with Deep Vent® 
DNA polymerase. The PCR product was phenol-CH 3 CI extracted 
and isopropanol precipitated, followed by BamHI and SphI 
digestion overnight at 37°C. Digested DNA was ligated to 
pACYC184 with compatible ends. Following ligation overnight, 
the DNA was transferred into ER2744 by transformation. After 
screening 10 plasmids isolated from individual Cm R (33 mg/ml) 
transformants, one clone pACYC184-PstIM #4 contained the 
correct pstIM fragment as shown by BamHI and SphI digestion. 
ER2744 [pACYC184-PstIM #4] was made competent by CaCI 2 
method. The premodified host ER2744 [pACYCl 84- PstIM] was 
used for establishing of the Sbfl endonuclease. 

Two PCR primers were synthesized for PCR amplification of 
the sbfIR gene. 

5'tggctgcagggatccggaggtttaaaatatgaacagcagtgacggcatcgacggaa 
cggtagc3' 

(sbf5R-2, underlined nt, BamHI site) (SEQ ID NO:44) 

5 'teg g g ccccg g gctcgag tcta a eg ttcg ta eg g ccca a g a a a tc t a g a cc 3 ' 
(sbf3RT7, underlined nt, Xhol site) (SEQ ID NO:45) 

PCR conditions were 95°C 5 min, 1 cycle; 95°C for 1 min, 
54°C for 1 min, 72°C for 1 min for 25 cycles with Deep Vent® 
DNA polymerase. PCR DNA containing the sbfIR gene was 
amplified from genomic DNA and purified by phenol/ CH 3 CI - 
extraction and CH 3 CI extraction, precipitated with isopropanol, 
dried and resuspended in TE buffer. The PCR DNA was blunt-end 
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ligated to pCAB16 at the BsaM. site. The ligated DNA was 
transformed into M.Pstl pre-modified E.coli ER2744 [pACYC184- 
PstlM] and Ap R Cm R transformants were selected at 37°C [Ap R 
Cm R :(100 mg/ml) and (33 mg/ml]. After screening 12 plasmids, 4 
5 clones were found to contain inserts. 500 ml LB + Amp cultures 

of clones #1 and #7 were grown overnight at 37°C. Plasmid 
DNA was purified from 450 ml of each cell culture by the CsCI 
method, and the other 50 ml of the cells of each were harvested 
by centrifugation and resuspended in 2 ml sonication buffer (10 

10 mM Tris-HCI, pH 8, 0.1 mM EDTA, 50 mM NaCI, 1 mM B- 

mercaptoethanol). Cells were lysed by sonication and cell debris 
removed by centrifugation. Cell lysate was assayed on I DNA for 
Sbfl activity. No Sbfl activity was detected for either pCAB16- 
SbflR #1 or #7 clones. Both pCAB16-SbfIR clones were 

15 sequenced directly with the following primers: 

5'ggagccatacagagagcgatttattcg3' 167 
(SEQ ID NO:46) 

20 5'ttgaaatcgaattaataagtctggatg3' 168 

(SEQ ID NO:47) 

DNA sequence showed an intact sbfIR in the opposite 
orientation to Plac and mspIR. The pCAB16-sbfIR #1 and #7 

25 clones contained correct DNA sequence for sbfIR, so an attempt 

was made to subclone the Sbfl endonuclease gene from 
pCAB16-sbfIR #1 and #7 into pLT7K, and then transform into 
ER2744 [pACYC184-PstIM]. Using the flanking BamHI and Xhol 
sites designed within the PCR primers, 10 ug of each pCAB16- 

30 sbfIR plasmid was digested with BamHI and Xhol at 37°C for 2 
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hours and the sbflR fragment were gel purified from a agarose 
gel, phenol-CH 3 CI extracted and isopropanol precipitated. The 
resuspended PCR fragment was was ligated at 17°C overnight 
into pLT7K with compatible ends, followed by transformation into 
5 ER2744 [pACYC184-PstIM] and plated on Ap R Cm R plates at 

37°C. Plasmid DIMA's were purified from 18 colonies (9 each from 
gel pure pCAB16-sbfIR #1 and #7, respectfully), 6 clones were 
found to carry the PCR insert. pLT7K-SbfIR #5, #12 and #14 
were inoculated into pre-warmed 10 ml cultures containing 

10 LB+Ap R Cm R and grown at 37°C overnight without shaking. 2 ml 

of the overnight cultures were diluted in pre-warmed 50 ml 
cultures containing LB+Ap R Cm R and grown at 37°C to an OD590 
of between 0.8 and 1.0, IPTG was to added to 85 mg/L and 
induced at 30°C for ~2 hours. Cells were harvested and lysed by 

15 sonication. Clarified cell lysates were assayed for Sbfl activity on 

I DNA. The extracts generated partial Sbfl digestion pattern. 
pLT7K-SbfIR #12 was sequenced with the following primers: 

5'tactgcgcgggcgctgctaaagcg3' 3A 
20 (SEQ ID NO:48) 

5'aatttctgctctcgcctgccgggc3' 3C 
(SEQ ID NO:49) 

25 5'ccagtccatgatcttctgaacgcc3' 5B 

(SEQ ID NO:50) 

5'agggagatcgacagagatcatcgc3' 5C 
(SEQIDNO:51) 

The DNA sequence of pLT7K-SbfIR #12 showed that it 
carried an intact sbflR gene. Shortly after this result, the Sbfl 
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methylase gene (sbflM) was completely identified by inverse PCR 
and sequenced. A final strategy was employed in which the 
sbflM gene was expressed from a low-copy-number plasmid and 
the endonuclease gene from pLT7K-SbfIR #12 was then 
transferred into this Sbfl pre-modified E. coli host. The sbflM 
gene was first amplified from genomic DNA in a PCR reaction and 
was cloned into a low copy number plasmid pACYC184 with pl5A 
origin and Cm R selection marker. The PCR primers have the 
following sequences: 

S'tggccgcigatccggaggtttaaaatatgcatccgatcgccagcactgaaactcg 
ccgc3' 

(sbfSM, underlined nt, BamHl site) (SEQ ID NO:52) 

S'ttgccgcatgcctcacgaggcagattccggaatctcacagaagagtcS' 
(sbf3M, underlined nt, Sphl site) (SEQ ID NO:53) 

PCR conditions were 95°C 5 min, 1 cycle; 95°C for 1 min, 
54°C for 1 min, 72°C for 1 min for 30 cycles with Deep Vent® 
DNA polymerase. PCR DNA containing the sbflM gene was 
amplified from genomic DNA and purified by phenol/CH3CI - 
extraction, precipitated with isopropanol, dried and resuspended 
in TE buffer. The PCR DNA was blunt-end ligated to pCAB16 at 
the BsaAI site. The ligated DNA was transformed into ER2502 
and Cm R transformants were selected at 37°C. After screening 
12 plasmids, two pCAB16-sbfIM clones, #3 and #5 were 
completely sequenced with the following primers: 

5'ggagccatacagagagcgatttattcg3' 167 
(SEQ ID NO:54) 

5'ttgaaatcgaattaataagtctggatg3' 168 
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(SEQ ID NO:55) 

5'ctttcccgtacttacaccgatgcc3' sbf-Ml 
(SEQ ID NO:56) 

5'tcctcgatcttgggcgaacgggcc3' sbf-M2 
(SEQ ID NO: 57) 

DNA sequence showed an intact sbflM in both orientations 
to Plac and mspIR. The P CAB16-sbfIR #3 and #5 clones 
contained correct DNA sequence for the sbflM gene, so an 
attempt was made to subclone the Sbfl methylase gene from 
P CAB16-sbfIR #3 and #7 into the low copy number plasmid 
PACYC184 with pl5A origin and Cm R selection marker. Using the 
flanking BamHl and Sphl sites designed within the PCR primers, 
5 ug of each pCAB16-SbfIM plasmid was digested with BamHI 
and Sphl at 37°C for 2 hours and the sbflM fragment were gel 
purified from a agarose gel and combined, phenol-CH 3 CI 
extracted and isopropanol precipitated. The re-suspended PCR 
fragment was was ligated at 17°C overnight into pACYC184 with 
compatible ends, followed by transformation into ER2848 and 
then selected for Cm R transformants. Plasmid DNA was purified 
from 8 colonies and PCR was done on two clones, #6 and #7, 
with primers sbf5M and sbf3M. PCR conditions were 95°C 5 min, 
1 cycle; 95°C for 1 min, 54°C for 1 min, 72°C for 1 min for 25 
cycles with Deep Vent® DNA polymerase. pACYC184-SbfIM #6 
and #7 contained the correct size insert for sbflM. 

The plasmid P ACYC184-SbfIM #7 was transferred into 
ER2848 to premodify E. coli. Competent cells were made by CaCI 2 
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method and the final strategy was employed in which the sbflM 
gene was expressed from a low-copy-number plasmid 
[pACYC184-SbfIM] and the endonuclease gene from [pLT7K- 
SbflR]. Isolate pl_T7K-SbfIR #12 was transferred into ER2848 
[pACYC184-SbfIM] and plated on Ap R Cm R plates at 37°C 
overnight. Two individual colonies were inoculated into 10 ml 
LB+Ap R Cm R and grown at 37°C overnight. 1 ml of each overnight 
culture was inoculated into 50 ml of LB+Ap R Cm R and grown at 
37°C to OD590 0.8 to 1.0, then the culture temperature was 
then lowered to 30°C, followed by IPTG (85 mg/L) induction at 
30°C for 2 hours to overnight. Both individual clones expressed 
R.Sbfl at more ~10 5 u/g per gram of wet E. coli cells (Figure 8). 

The strain NEB#1500, ER2848 [pACYC184-SbfIM #7, 
pLT7K-SbfIR #12 has been deposited under the terms and 
conditions of the Budapest Treaty with the American Type 
Culture Collection on August 5, 2003 and received ATCC 
Accession No. PTA-5371. 



